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INTRODUCTION  TO  RELATIONAL  PROGRAMMING* 

B.  J.  MacLennan.  81/02/24. 

Naval  Postgraduate  School 
Monterey,  CA  93940 

1 .  Introduction 

In  this  paper  we  discuss  relational  programming ,  i.e.  a 
style  or  programming  in  which  entire  relations  are  manipulated 
rather  than  individual  data.  This  is  analogous  to  functional 
programming,  wherein  entire  functions  are  the  values  manipulated 
by  the  operators.  We  will  see  that  relational  programming  sub¬ 
sumes  functional  programming  because  every  function  is  also  a 
relation.  It  is  appropriate  at  this  point  to  discuss  why  we  have 
chosen  to  investigate  relational  programming. 

As  we  have  noted,  relational  programming  subsumes  functional 
programming;  hence,  anything  that  can  be  done  with  functional 
programming  can  be  done  with  relational  programming.  Further¬ 
more,  relational  programming  has  many  of  the  advantages  of  func¬ 
tional  programming:  for  instance,  the  ability  to  derive  and 
manipulate  programs  by  algebraic  manipulation.  A  well  developed 
algebra  of  relations  dates  back  to  Boole's  original  work  and  has 
been  extensively  studied  since  then.  Although  relations  are  more 

*  The  work  reported  herein  was  supported  by  the  Foundation 
Research  Program  of  the  Naval  Postgraduate  School  with  funds 
provided  by  the  Chief  of  Naval  Research. 
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general  chan  cunctions,  their  laws  are  orten  simpler.  For 
Instance,  (tg)**^  =  g“^t“^  Is  true  tor  all  relations,  but  true 
only  for  functions  that  are  one-to-one.  Also,  relational  pro¬ 
gramming  more  directly  supports  non-linear  data  structures,  such 
as  trees  and  graphs,  than  does  functional  programming.  In  rela¬ 
tional  programming  the  basic  data  values  are  themselves  rela¬ 
tions,  whereas  in  functional  programming  there  is  a  separate 
class  of  objects  (lists)  used  for  data  structures.  One  final 
reason  for  investigating  relational  programming  is  that  it  pro¬ 
vides  a  possible  paradigm  for  utilizing  associative  and  active 
memories.  As  a  teaser  for  what  is  to  come,  we  present  the  fol¬ 
lowing  example  of  a  relational  program.  We  will  take  a  text  T, 
represented  as  an  array  or  words  (i.e.,  T:i  is  the  i-th  word), 
and  generate  a  frequency  table  F  so  that  F:w  is  the  number  of 
occurences  of  word  w  in  T.  Now  we  will  see  (section  3)  that  T:w 
is  the  set  of  all  indices  of  the  word  w.  If  we  let  #:C  be  the 
cardinality  of  a  class,  then  the  number  of  indices  (occurences) 
or  w  is  just  #;(T:w).  Therefore  we  can  write  F  =  #T  (section  6). 
For  a  second  example,  we  will  see  in  section  13  that  a  program  to 
update  a  payroll  file  i  can  be  written: 

i'  “  Md^  where  u  “  (,H)(+) 

2.  Classes  and  Relations 

As  is  usual  we  will  use  xRy  to  mean  that  x  bears  the  tela- 


tion  R  to  y.  Similarly,  we  will  write  xC  oi  xSC  to  mean  x  is  in 
the  class  C  (i.e.  x  has  the  property  C) .  Our  theory  ot  relations 
will  be  typeless,  like  that  described  in  [6];  this  seems  mote 
appropriate  to  programming  than  systems  such  as  Russell's  "rami- 
tied  type  theory."  In  most  other  respects  our  notation  follows 
that  of  Carnap  [1]  and  Whitehead  and  Russell  [8].  There  is  no 
special  significance  to  the  case  of  variables,  although  we  will 
often  distinguish  relations  (and  classes)  from  the  things  they 
relate  by  putting  them  in  upper  case. 

We  often  need  to  talk  of  the  individuals  that  can  occur  on 
the  tight  ot  left  of  a  relation.  We  say  that  x  is  a  left-member 
ot  R  whenever  there  is  a  y  such  that  xRy. 

X  Lro  R  3-y(xRy) 

Right-member  and  member  (Rm  and  Mm)  ate  defined  analogously. 

3.  Functions 

We  define  functions  as  special  classes  of  relations:  h  rela¬ 
tion  F  is  a  function  if  and  only  if  it  is  left-univalent: 

Felun  4— »  Vxyz[  yFx  A  zFx  y*z  ] 

If  F  is  left-univalent  then  we  can  write  F:x  for  the  unique  y 
such  that  yFx  (if  such  a  y  exists).  This  differs  from  the  usual 
convention,  in  which  y*F:x  means  xFy,  but  agrees  with  [8]  and 


works  better  with  the  rest  of  the  notation.  Right-univalent  and 
bi-univalent  relations  (run  and  bun)  are  defined  analogously. 


The  fact  that  F:x  may  be  meaningless  makes  it  convenient  to 
use  several  other  relations  derived  from  F.  One  of  these  is  the 
plural  descr iption.  If  F  is  any  relation  and  C  is  a  class  then 
F!:C  is  the  set  of  all  y  such  that  yFx  for  some  x  in  C,  i.e., 

y  6  Fl;C  3>x(yFx  A  xC) 

Notice  that  the  operation  F! ;C  is  defined  for  all  relations  F  and 
classes  C,  regardless  of  whether  FSlun  or  the  members  of  C  are 
right  members  of  F. 

Related  ideas  are  the  image  and  converse  image  of  an  indivi¬ 
dual.  If  R  is  a  relation,  then  cRx  means  that  c  is  the  class  of 
individuals  related  to  x.  This  class  is  called  the  image  of  x, 
and  is  defined: 

y  e  R:x  »  yRx 

The  analogous  idea  is  that  of  the  converse  image  of  y: 

X  e  R;y  »  yRx 

Like  the  plural  description,  R  and  R  are  defined  for  all  R  and 
all  arguments.  It  is  generally  safer  to  use  f  than  f  since  f:x 
may  be  undefined  whereas  f:x  is  always  defined. 
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4.  Combining  Relations 


We  will  next  investigate  ways  ot  combining  relations  and 
classes.  The  simplest  methods  ate  just  abstractions  ot  the  logi¬ 
cal  connectives  used  between  propositions:  intersection,  union, 
negation  and  diiterence  (A»  V,  -) .  For  instance  R  V  S  is  defined 
so  that: 


x(R  V  S)y  xRy  V  xSy 


As  an  example  of  the  use  of  these  operations,  the  detinition  ot 
Mm  can  be  written: 


Mm  =  Lm  V  Rm 


The  logical  connectives  satisfy  the  usual  properties  of  a  Boolean 
algebra  (e.g.,  DeMotgan's  theorem). 

We  will  also  define  the  converse  of  a  relation.  The  rela¬ 
tion  R”'*’  is  called  the  converse  ot  R,  i.e.  xR”^y  4— »  yRx.  If  f 
is  a  function  then  t”^  is  the  inverse  of  f. 


5.  Limiting  and  Restriction 


It  is  often  useful  to  limit  the  left  or  right  domain  of  a 
relation.  Consider  the  relation  x  sin”^  y,  which  means  that  x  is 
an  arcsine  of  y.  We  cannot  write  x  *  sin”^:y  because  sin“^  is 
not  left  univalent  (i.e.  it  is  not  a  function).  We  can  solve 
this  problem  by  defining  a  function  Sin  whose  arguments  are 


-  6  - 


testticted  to  be  in  the  range  -ii/2  to  ii/2.  Let  S  be  the  class  ot 
teals  in  this  range: 


S  =  (>:-»/2)  A  (<:w/2) 


then  we  will  write  sin>S  for  the  sine  tunction  with  its  arguments 
restricted  to  S,  which  is  exactly  the  Sin  tunction  we  sought: 


Sin  =  sin> t (>:-• f/2)  A  (<:w/2)] 


This  function  is  bi-univalent,  so  it  is  invertible.  It  is  now 
perfectly  meaningful  to  write  Sin“^:y  (if  y  Lm  sin).  The  right- 
restriction  operation  is  defined: 


x(R>S)y  xRy  A  yS 


The  left-restriction,  S<R,  is  defined  analogously.  These  nota¬ 
tions  can  be  combined  to  restrict  both  domains:  S<R>T.  The  com¬ 
bination  S<R>S  is  so  common  that  a  special  restriction  notation 
is  provided  for  it: 


rSs  =  s<r>s 


For  instance,  predS(>:0),  is  the  predecessor  relation  restricted 
to  positive  numbers. 


6.  Relative  Product  and  Composition 


If  R  is  the  relation  "...  is  a  son  of  ..."  and  S  is  the 
relation  "...  is  a  brother  of  ...",  then  the  relative  product , 
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RlS,  is  the  relation  "...  is  a  son  ot  a  brother  or  ...".  More 
rormally , 

x{r|s)2  3-y(xRy  A  ySz) 

Where  there  is  little  chance  of  confusion,  we  will  write  RS  for 
RlS.  If  f  and  g  are  functions  it  is  easy  to  see  that  fig  is  the 
composition  of  these  functions.  Hence,  fg:x  =  f:(g;x). 

It  is  convenient  to  have  a  notation  for  relative  products  of 
a  relation  with  itself.  For  instance,  the  "grandparent"  relation 
can  be  written  par ent I  par ent ,  which  we  abbreviate  patent  .  In 
gener  al , 

R^  =  R 

rH+I  a  (R'^)R  a  RIr'^) 

R-n  _  (rO)-!  (R-i)^ 

R°  =  =«(^te»:R) 

It  is  easy  to  show  these  properties  of  the  relative  product: 

(rs)t  =  r(st) 
t(sVt)  =  tsVrt 
(rVs)t  =  rsVrt 
r(sAt)  C  rsAtt 
(r  A  s)  t  C  rt  A  St 
3-(ts)  3(Rra:r  ALm:s) 
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(t-l)-l 

(ts)"^ 

(r"*)^  = 

j,in  j,  n  Q 

T 

r  t""^  = 


(s"l) (r'^) 

{m,n>0) 

(m,n^O,  ot  t6bun) 
j,ra+n  (tebun) 

t“^r  =  r®  {t6bun) 


7 .  Structures 

7. 1  initial  and  terminal  members 

Suppose  R  is  the  relation  represented  by  the  following 


•j  1 

Since  a=R:g  and  g=R”-‘‘:a  it  can  be  seen  that  R”-*’  follows  an  arrow 
and  R  goes  against  an  arrow.  Now,  notice  that  the  left  and  right 
members  of  R  are: 

Lm:R  =  {a,  b,  c,d,  e,  f,g} 

Rm;R  =  {  g ,  f ,  e ,  d ,  i ,  h  } 


We  define  the  initial  members  of  R  to  be  those  members  which  are 
not  pointed  at  by  an  arrow.  Therefore,  the  initial  members  of  R 
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aie  the  lett  membets  that  ate  not  tight  membets. 

_ ^ 

init:R  =  {Lm-Rm):R  =  {a,  b,  c} 

The  tetminal  membets  ot  a  relation  do  not  point  to  other  membets: 

tetm:R  =  (Rm-Lro):R  =  {h,  i} 

When  a  relation  is  used  to  represent  a  data  structure,  the  above 
functions  become  important. 

For  instance,  a  sequence  is  represented  by  a  relation  with 
the  structure: 


In  this  case  init:S  is  the  unit  class  containing  the  first  ele¬ 
ment  of  the  sequence  (i.e,,  aj^)  and  tetm:S  is  the  unit  class  con¬ 
taining  the  last  element  of  the  sequence  (i.e.,  a^) .  Similarly, 
S>(-init;S)  is  the  sequence  with  its  first  element  deleted. 
Hence,  the  following  common  sequence  manipulation  functions  can 
be  defined: 


oc:S 

=  9  init:  S, 

"first" 

au  S 

=  0  term:  S, 

"last" 

n:S 

=  S> (-init:S)  , 

"final" 

A:S 

=  (-term:S)<S, 

"initial 

where  0  selects  the  element  of  a  singleton  set  (©=(=  )). 


More 
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opetations  on  sequences  ate  discussed  in  the  next  section. 


As  another  example  ot  the  use  or 
a  relation  T  reptesentinq  a  tree, 
or  the  tree,  and  tetm:T  is  set  o£  the 
result  is  analogous  tor  totests. 
toots  is  init;F  and  the  set  or  leaves 


'init*  and  'term',  consider 
Then,  0  init:  T  is  the  root 
leaves  of  the  tree.  The 
Given  a  forest  F  the  set  of 
is  term:F. 


7. 2  higher  level  opetations 

The  set  of  nodes  directly  descended  from  n  is  just  F'-^-tn. 
For  instance,  the  set  of  nodes  directly  descended  from  a  root  is 
F“^linit:F.  and  the  set  of  nodes  that  point  to  leaves  is 
F ! tetm;F. 


These  opetations  can  be  used  for  obtaining  the  maximum  and 
minimum  of  sets.  Suppose  '<'  is  the  less-than  relation  on 
integers  and  S  is  some  set  of  integers.  Then  <SS  is  the  less 
than  relation  restricted  to  this  set,  i.e.  it  is  a  sorting  of  the 
set.  Now  note  that  oc:  (<»S)  and  uk  (<»S)  ate  the  minimum  and  max¬ 
imum  elements  of  the  set: 


min :S  »  oe:  {<SS) 
max:S  »  uk  (<#S) 


These  opetations  ate  only  defined  if  S  has  two  or  mote  elements, 
since  an  irreflexive  relation  cannot  relate  less  than  two  ele¬ 
ments.  That  is,  an  irreflexive  relation  when  restricted  to  a 
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unit  ot  empty  class  becomes  the  empty  relation.  Notice  that  we 
can  select  the  maximum  and  minimum  based  on  any  relation  that  is 
a  ser ies  (i.e.,  transitive,  irreflexive  and  connected). 


The  following  ate  simple  properties  ot  these  operations! 


init:r  =  term: (t“^) 
tetm:t  =  init:(t“-^) 


init: ( t»s) 


8.  Sequences 


terra: ( r“^»s) 


In  this  section  we  will  continue  the  discussion  of  sequences 
begun  in  the  last  section.  We  saw  that  it  was  easy  to  detine  the 
selector  functions  on  sequences  (oc,  oi,  A,  H)  .  This  provides  us 
with  functions  for  taking  sequences  apart.  We  will  define  the 
ordinal  couple  ot  pair ,  which  puts  them  together.  It  x  and  y  are 
two  objects,  then  'x,y'  is  the  relation  that  relates  x  and  y  but 
no  other  objects. 


{x,y)  = 


Observe  that 


oc;  (x,y) 
UK  (x,y) 
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It  will  occasionally  be  convenient  to  write  ordinal  couples  in  a 
vertical  rormat: 

(5)  =  (x,y) 

The  class  of  all  the  ordinal  couples  (or  pairs)  that  can  be  made 
from  the  classes  S  and  T  is  SXT: 

pe(SXT)  3-xytxes  A  ye?  A  p=(x,y)  ] 

8 . 2  catenation  and  consing 

If  s  and  t  ate  sequences  then  we  can  define  an  operation  's^t', 
which  is  the  catenation  of  s  and  t.  To  form  this  catenation  we 
must  hook  the  last  element  of  s  to  the  first  element  of  t: 

s^t  =■  s  V  (ui:s,  oc:t)  V  t 

The  catenation  operation  is  only  defined  for  sequences,  which  are 
required  to  have  at  least  two  elements  (since  an  irreflexive 
relation  with  less  than  two  elements  is  the  empty  relation). 

How  do  we  add  a  single  element  to  the  left  or  right  of  a 
sequence?  The  "cons  left"  and  "cons  right"  operations  are  easy 
to  define: 


X  cl  s  *  (x,  oc:s)  V  s 
s  cr  y  »  s  V  (ui:s,  y) 


It  is  easy  to  show  that  if  s  is  a  sequence,  then: 
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cc:  (x  cl  s)  *  X 

n: (x  cl  s)  =  s 

UK  (s  ct  y)  =  y 

A: (s  cry)  =  s 


Also,  it  s  is  a  sequence,  then  s  V  (uks,  oc:s)  is  a  ring  formed  by 
joining  the  last  element  of  s  to  the  first  element. 

If  s  is  a  sequence,  then  s“^  is  the  reverse  of  s.  Hence, 


oc;s  = 

UK  s”^ 

-1 

UKS  = 

cc;s 

A:  s”^ 

=  (U: 

ii:s”^ 

»  (A; 

(s-t)”^  = 

(x  cl  s)“^  =  s"^  ct  X 

(s  ct  x)“^  =  X  cl  s“^ 


(x,y)"^  =  (y,x) 


®  ®  '  (1) 


9.  Binaty  Operations 

In  this  section  we  will  discuss  out  approach  to  binary 
operations  —  that  is,  to  functions  with  two  arguments  and  one 
result.  We  have  already  seen  how  unary  functions  ate  connected 


14 


to  relations.  For  instance,  we  can  write  the  fact  that  y  is  the 
sine  of  x  by  either;  'y  sin  x*  or  y  ®  sin:x.  Since  we  only  deal 
with  binary  relations,  we  will  have  to  have  a  new  convention  for 
handling  binary  functions.  This  convention  is:  we  will  combine 
the  two  arguments  of  an  operation  into  a  pair.  For  instance,  we 
can  define  a  relation  'sum*  such  that 

X  sum  (y,z) 

if  and  only  if  x  is  the  sum  of  y  and  z.  We  can  use  our  colon 
convention  as  usual,  e.g., 

X  =  sum: {y,z)  <->  x  sum  (y,z) 

Now,  it  would  be  inconvenient  to  have  to  invent  names,  such  as 
'sum',  for  each  operation,  such  as  Hence,  we  will  adopt  a 

systematic  convention  for  making  such  names:  either  placing  the 
conventional  infix  symbol  for  the  operation  in  bold  face  or  in 
parentheses.  For  instance, 

x+(y,z)  X  =  +; (y,z)  ^  x  =  y+z 

X  (+)  (y,z)  <-»  X  »  (+);(y,z)  x  =  y+z 

This  notation  will  permit  us  to  manipulate  in  a  more  regular 
fashion  the  usual  arithmetic  operations  (+,  -,  *,  /)  as  well  as 
the  relational  operations  (A,  V,  etc).  For  instance,  if  S  is  a 
class  of  classes,  then 


) 


(A)  l :SXS 


is  the  class  ot  all  paitwise  intet sections  ot  membets  o£  S 


It  is  cLten  convenient  to  be  able  to  generate  simple  rela¬ 
tions  rrom  a  binary  operation.  Following  Russell  and  Whitehead, 
let  1  represent  any  binary  operation.  We  define: 

x(ii2)y  »  X  =  ywz 

X  (yiT)  2  X  =  ywz 

Hence , 

x(-l)y  »  X  =  y-1 

therefore  (-1)  is  the  predecessor  relation.  These  can  be  used  as 
functions; 

(-l):x  =  x-1 

(+l):x  =  x+1 

This  convention  makes  it  very  easy  to  form  more  complex  func¬ 
tions.  For  instance,  if  we  want  f:x  =  sin:(l/x)  then  we  can 
define  f  =•  sin(l/)  To  see  that  this  works; 

f;x  =  [sin(l/)]:x  »  sin;[  (1/) ;x  ]  *  sin;[  1/x  ] 

10.  Combinator s 

In  this  section  we  will  discuss  several  powerful  operations 
for  manipulating  relations.  These  are  called  combinator s  because 
of  their  similarity  to  the  combinators  of  Curry  and  Feys  [4]. 


Fitsc  observe  the  action  ot  the  (x,)  and  ( ,y)  functions: 

(x,);y  »  (x,y) 

(,y):x  =  (x,y) 


Now  note  that 


f(x,):  y  »  f:[(x,):y]  =»  f:(x,y) 

In  general,  if  f  is  a  binary  function,  then  f(x,)  and  f(,y)  are 
the  "partially  instantiated"  unary  functions.  This  is  the  effect 
of  Cutty  and  Feys  "B"  combinatot  [4]  ,  the  elementary  compositor . 

If  ':R  =»  R“^  then,  since  is  the  reverse  of  a  sequence, 

w*  is  the  reverse  form  of  an  operation.  For  instance,  (-) '  is 
the  reverse  subtract  operation: 

(-)':(x,y)  =  (x,y) ) 

»  (-):(yrX) 

»  y-x 

Thus  (-) '  can  be  read  "subtract  from"  and  (/) '  can  be  read 
"divide  into".  This  is  Curry  and  Feys  "C"  combinator,  or  elemen¬ 
tary  petmutator . 

The  next  combinatot  we  will  discuss  is  the  paralleling  of 
relations,  §>,  which  is  defined: 


(v)  @  5)  ^  ^ 
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So,  it  t  and  g  ate  tunctions, 

(l)  =  (?)  ■  (li?) 

Hence,  ^  is  the  element-wise  combination  ot  f  and  g. 

Another  combinatot  is  the  elementary  duplicator ,  W, 
so  that 


defined 


(W: f ) :  X  »  f ; (x ,x) 


It  we  define  A:x  =  (x,x)  then  it  is  easy  to  see  that  W:f  is  just 
tA*  Fot  instance,  (x)A  is  the  squaring  function: 


(x)A:n 


(x)  :  Asri)  “  (x):(n,n)  =  nxn  »  n^ 


It  should  be  clear  that  Backus'  [f,g]  combining  form  is  just  our 
since 

•  UiiS) 

Since  this  combination  is  so  common  we  will  adopt  a  special  nota¬ 
tion  for  it: 


k\ 


f 

1 


A 


Hence,  ||  :x  -  (||*) 


Some  of  the  properties  satisfied  by  these  combinators  are: 
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R  T 

s  0- 

_  RT 
"  STT 

/R\  n 

_  r" 

■  i" 

_  RTI 

STi 

R  Tl 

s  rs\ 

sol 

-  ^ 

(I)' 

-  1  -  ■( 

'  i 

■  1 

*  R>(Rm:g) 

=  SXRmif) 

R  _ 

s  " 

.  ftae  1 

sar| 

cl 

-  n*' 

cr 

As  an  example  o£  these  combinatots  it  is  easy  to  show  that 

f  =  (+)-^^ 

is  the  function  f:t  *  t^+2t. 


Our  last  combinatot  is  the  meta-*application  operator  ,  ::f 

which  corresponds  to  Curry  and  Feys*  S  combinatot: 

(f : :g) :x  *  (fix) : (g:x) 

For  instance,  [({)'] ::init  is  the  operation'  that  gives  the  set  of 
descendants  of  roots  of  a  forest,  F,  since 
([  ( 1 )  '  ]  : :  init)  :F  ■  (F"^  I )  :  ( init ;F)  . 


a 


An  attay  is  just  a  tunction  from  a  contiguous  subset  o£  the 
integets  to  some  set  o£  values.  It  A  is  an  attay  and  i  Rm  A  then 
A:i  is  the  i-th  element  ot  A.  Similarly,  it  I  C  Rm:A  is  a  set  of 
index  values  then  A!:I  is  the  corresponding  set  of  array  values 
and  A>I  is  the  subarray  of  A  selected  by  those  indices. 

It  is  easy  to  define  multi-dimensional  arrays:  they  are 
just  arrays  whose  elements  are  selected  by  sequences  of  integers, 
e.g.  M:(i,j).  If  M  is  a  two-dimensional  array,  then  M(i,)  is  the 
i-th  tow  of  M  and  M(,j)  is  the  j-th  column  of  M.  Also,  if  I  is  a 
set  of  row  indices  and  J  is  a  set  of  column  indices  then  M>(lXJ) 
is  the  submatrix  of  M  selected  by  these  sets.  It  is  easy  to  see 
that  M'  is  the  transpose  of  M,  since 

M':(i,j)  =  M:(’:{i,j))  =  M:(j,i) 

Mote  generally,  if  P  is  a  permutation  function  (i.e.  a  bijection 
from  an  index  set  into  itself)  then  AP  is  the  result  of  permuting 
A  by  P. 

Suppose  X  is  an  element  of  the  array  A  (i.e.,  for  some  i, 
x=A:i).  Then  A:x  is  the  set  of  all  indices  for  which  x=A:i. 
Therefore  we  can  find  the  index  of  the  first  occurence  of  x  in  A 
(i.e.  APL's  iota  operator)  by  minA:x.  In  general,  if  P  is  some 
property  (i.e.  class),  then  A'^ltP  is  the  set  of  indices  of  all 
elements  of  A  that  satisfy  P.  A  sorted  reflexive  sequence  of 
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these  indices  is  just  <  S  (A“^!:P) 

It  is  easy  to  convert  arrays  to  sequences  and  vice  versa. 
Suppose  all  the  elements  or  A  are  distinct,  then  A"^  is  a  func¬ 
tion  that  returns  the  index  of  an  element  of  A.  We  want  to 
define  a  sequence  S  such  that  xSy  if  and  only  if  x  proceeds  y  in 
A,  i.e.  the  index  of  x  is  one  less  than  the  index  of  y. 


xSy  » 

(A"^;X) 

=  (A"^:y)-1 

(A“^:x) 

(-1)  (A"^:y) 

x[Al (-1) 

1  A“^  ]  y 

Hence,  S  =  A(-1)A"^. 

We  will  finish  our  discussion  of  arrays  by  investigating  the 
generation  of  sorted  arrays.  Let  S  be  a  set  of  integers  to  be 
sorted,  then  £SS  is  a  structure  which  relates  lesser  elements  to 
greater  elements.  Now  if  x  is  any  element  of  the  set,  (£#s') : x  is 
the  set  of  all  elements  less  than  x.  Thus  [#  (<_#s')  ]  :x  is  the 
number  of  elements  of  S  less  than  or  equal  to  x.  This  is  just 
the  index  of  x  in  the  sorted  array  we  seek.  Hence  if  A  is  the 
sorted  array,  xAi  if  and  only  if  i  [#  (£85)  ]  x ,  so  A  =»  [#  (£Xs')  ]  . 

Of  course  this  can  be  generalized  to  any  ordering  relation. 

12.  Scanning  Structures 

It  is  often  useful  to  scan  a  structure  while  performing  some 
processing  at  each  node.  When  the  data  structure  is  a  sequence 
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this  amounts  to  APL's  reduce  operator  and  Backus'  insert  opera¬ 
tor.  We  will  detine  a  scanning  operation  that  works  on  a  more 
general  class  ot  structures.  This  operator  can  be  understood 
intuitively  as  follows:  The  state  of  the  scanning  process  is 
represented  by  a  set  of  "read  heads"  each  of  which  is  "positioned 
over"  a  node  and  holds  state  information  accumulated  from  the 
nodes  it  has  already  visited.  A  node  can  be  processed  when  a 
read  head  has  moved  to  that  node  over  each  edge  which  leads  into 
the  node.  When  this  occurs  a  processing  function  is  applied  to 
the  node  (as  first  parameter)  and  the  union  of  the  state  infotraa- 
tion  of  each  ot  the  read  heads  (as  second  parameter).  The  result 
or  this  processing  step  becomes  the  state  information  associated 
with  a  new  set  of  read  heads  which  are  advanced  along  each  edge 
leading  out  from  the  node.  The  processing  of  the  structure  is 
completed  when  all  read  heads  have  arrived  at  terminal  nodes 
(hence  this  scanning  operation  is  not  defined  for  cyclic  struc¬ 
tures).  Scanning  a  structure  is  started  by  positioning  a  read 
head  with  initial  state  information  over  each  initial  node. 

The  scanning  operation  is  symbolized  by  fli,  where  f  is  the 
processing  function  and  i  is  the  initial  state  for  the  read 
heads.  For  instance,  it  V  is  a  vector,  (+)I0:V  will  scan  the 
elements  of  V  using  (+)  (i.e.  APL  +/V  ot  Backus'  (/+) :V) .  For  a 
mote  interesting  example,  suppose  T  is  an  attributed  parse  tree, 
E  is  a  function  that  evaluates  attributes  and  B  is  the  initial 
set  or  attribute  bindings.  Then  ClB:T  propogates  the  values  of 


i 
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inherited  attributes  down  to  the  leaves  ot  the  tree.  Conversely, 
ElB:(T“^)  propogates  the  values  of  synthesized  attributes  back  to 
the  root.  Hence,  repeated  applications  of  ElB  and  (ElB) '  will 
evaluate  all  of  the  attributes.  Of  course,  this  program  will 
work  just  as  well  if  T  is  a  forest  of  parse  trees.  The  I  opera¬ 
tor  is  still  undergoing  evaluation  as  it  is  one  of  several  possi¬ 
ble  structure-directed  scanning  operations. 

13.  Examples 

In  this  section  we  will  give  several  examples  of  relational 
programs. 

PAYROLL  EXAMPLE:  Suppose  we  have  a  file  $  of  employee 
records,  where  r  =  J:n  is  the  record  for  the  employee  with  the 
employee  number  n.  We  will  suppose  that  employee  records  are 
functions  defined  so  that: 

r  :N  =>  employee  name 

r;H  =  hours  worked  so  far  this  week 

r:R  =  pay  rate 

We  are  given  an  update  file  U  such  that  U:n  is  the  number  of 
hours  worked  by  employee  n  today.  We  wish  to  generate  a  new  pay¬ 
roll  file  . 

SOLUTION:  Let  r  »  |:n  and  r'  =  |':n  be  the  old  and  new 
employee  records.  It  is  clear  that  r'  is  the  same  as  r  except 


23 


rot  its  H  tield.  In  otdet  to  raodity  part  or  a  relation,  we  will 
use  the  Md  runction  detined  by: 

Md;(S,R)  =  R  V  S>(-Rm:R) 

Then,  if  h'  represents  the  new  value  of  the  H  field,  the  new 
employee  recotd  is 


r'  »  Md:  (i  ,  (h'  ,H)) 

Now,  h*  is  just  the  cumulative  hours  worked: 

h'  =  (§:n) :H  +  U:n 


By  combining  these  results  we  have. 


i':n  a  r'  *  Md;(  fi:n,  (h',H)) 


To  find  f  we  must  factor  out  the  employee  number  n.  To  do  this, 
note  that  (5:n) :H  =  (:H):(§:n)  =  (:H)5:n.  That  is,  (:H)5  is  a 
slice  of  the  payroll  file:  the  hours  worked  for  each  employee. 
Theretore , 


h'  =  (|:n):H  +  U:n 


( ;H) t :n  +  U:n 


;n 


Now,  define  the  updating  function  u  by 

u:n  a  (  :n,  H  )  =  ( ,H)  (  +  ;n 

Then,  |':n  =  Md:(5;n,u:n)  =  Md^  : n.  Ther for e ,  the  solution  to 
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out  problem,  the  new  payroll  file,  is 

I'  =  Mdjj 

where  u  »  (  ,h)  (»)  ^ 

CHECK  ISSUING  EXAMPLE:  Suppose  we  wish  to  take  the  payroll  file 
from  the  previous  example  and  generate  checks  for  the  employees. 
We  will  assume  that  a  function  C  is  available  such  that  C:(nm,p) 
returns  a  check  in  the  amount  p  made  out  to  the  name  nm. 

SOLUTION:  We  will  ignore  overtime  computations.  Hence,  if  n 
is  an  employee  number  then  (:n:N  is  his  name  and 


If  we  just  want  a  set  of  checks,  this  is  Lm:F. 


14. 
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Implementation  Notes 

The  ptimaiy  goal  of  out  investigation  has  been  to  determine 
if  relational  programming  is  significantly  better  than  conven¬ 
tional  methods.  It  would  be  premature  to  devote  much  effort  to 
implementation  studies  before  it  is  even  determined  if  relational 
programming  is  an  effective  programming  methodology.  However ,  a 
brief  discussion  of  implementation  possibilities  is  probably  not 
out  of  line. 

The  most  obvious  representation  of  a  relation  is  the  exten- 
sional  representation,  in  which  all  the  elements  of  a  relation  or 
class  are  explicitly  represented  in  memory.  There  are  many  kinds 
of  extensional  representations,  such  as  hash  tables,  binary  trees 
and  simple  sorted  tables.  Of  course,  performance  can  be  improved 
through  the  use  of  associative  memories  and  active  memories  (in 
which  each  memory  cell  has  a  limited  processing  capability). 

Some  relations  and  classes  will  be  so  large  that  it  is 
uneconomical  to  represent  them  explicitly  in  memory.  In  these 
cases  an  intensional  representation  should  be  used.  Here  a  class 
or  relation  is  represented  by  a  formula  or  expression  for  comput¬ 
ing  that  relation  or  class.  Operations  on  the  class  or  relation 
are  implemented  as  formal  operations  on  the  expression.  This  is 
feasible  because  of  the  simple  algebraic  properties  satisfied  by 
relations.  It  can  be  seen  that  an  intensional  representation  is 


really  just  a  variant  of  a  lazy  evaluation  mechanism 


Sometimes 
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an  intensional  representation  is  necessary;  for  instance,  rela¬ 
tions  or  infinite  cardinality,  such  as  the  numerical  operators 
and  relations,  require  an  intensional  representation. 

Although  the  programmer  could  be  allowed  to  choose  between 
extensional  and  intensional  representations  for  his  relations, 
this  is  not  necessary.  It  is  probably  feasible,  and  certainly 
higher  level,  to  have  the  system  choose  representations  on  the 
basis  of  cardinality  estimates  of  the  classes  and  relations 
involved.  The  algebra  of  relations  is  regular  enough  that  many 
of  these  decisions  can  be  made  at  compile  time.  Any  that  can't 
can  be  deferred  to  run-time  when  exact  cardinality  information  is 
available. 

15.  Conclusions 

Of  course,  we  are  not  the  first  to  propose  introducing 
aspects  of  a  relational  calculus  into  programming.  Space  limita¬ 
tions  prohibit  a  comparison  with  previous  work,  such  as  that  by 
Feldman  and  Rovner  [51,  Codd  [3]  and  Childs  [2].  What  does  dis¬ 
tinguish  this  investigation  is  the  exclusive  use  of  relations  in 
a  general  purpose  programming  language.  It  is  hoped  that  the 
preceeding  discussion  has  made  plausible  some  of  the  advantages 
claimed  for  relational  programming  in  the  Introduction.  Consid¬ 
erable  work  remains  to  be  done  in  evaluating  the  effectiveness  of 
a  relational  calculus  as  a  programming  tool.  For  instance,  the 


optimum  set  of  combinatois  and  relational  operators  must  be 
selected.  Another  non-trivial  problem  is  the  selection  of  a  good 
notation  for  the  relational  calculus.  More  from  convenience  than 
conviction  we  have  used  the  notation  of  [8]  and  [1].  Making 
relational  programming  an  effective  tool  will  require  designing  a 
notation  that  combines  readability  with  the  manipulative  advan¬ 
tages  of  a  two-dimensional  algebraic  notation.  This  is  all  prel¬ 
iminary  to  any  serious  considerations  of  software  or  hardware 
implementation  techniques.  The  reader  who  is  interested  in  more 
details  about  programming  in  a  relational  calculus  should  consult 
[7]. 
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